FraunhoferFraunhoferICNAPICNAP

ICNAP Login

The image displays a flowchart and tables related to data quality assessment for machine learning. It has a black background with multiple sections connected by arrows. At the top, there are two tables indicating the upload process for datasets, featuring columns for "File Type," "Observations," and "Features." Below, a section titled "Quantification of the criteria" outlines criteria for cross-sectional and timeseries data. A color-coded table displays quality criteria across four categories: Dataset, Data Points, Feature, and Model, assigned scores ranging from 0 to 1. The score ranges are highlighted in red, yellow, and green. A total score of 0.85 is indicated, alongside steps for weighting criteria, creating overall ratings, and visualizing outputs. Each section is labeled with distinct categories and includes numerical values and weights.

Data Quality Assessment


Summary

We have developed a data quality assessment tool that enables users with minimal data expertise to evaluate the quality of their production data quickly. The tool assesses datasets based on 41 criteria across four categories, resulting in an overall quality score with adjustable weights. It also offers explanations, visualizations, and improvement suggestions. This system simplifies the evaluation process, aids decision-making, and helps identify valuable machine learning use-cases.

Topic Fields
Data Analytics
Published2022
Involved Institutes
Project TypeICNAP Research/Transfer Project
Responsibles

© Fraunhofer 2025

ContactTerms of useData ProtectionEditorial Notes